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We show that qualitatively different epidemic-like processes from 
distinct societal domains (finance, social and commercial block- 
busters, epidemiology) can be quantitatively understood using 
the same unifying conceptual framework taking into account the 
interplay between the timescales of the grouping and fragmenta- 
tion of social groups together with typical epidemic transmission 
processes. Different domain-specific empirical infection profiles, 
featuring multiple resurgences and abnormal decay times, are re- 
produced simply by varying the timescales for group formation 
and individual transmission. Our model emphasizes the need to 
account for the dynamic evolution of multi-connected networks. 
Our results reveal a new minimally-invasive dynamical method for 
controlling such outbreaks, help fill a gap in existing epidemio- 
logical theory, and offer a new understanding of complex system 
response functions. 

Complex systems | Modern epidemiology | Group dynamics 

The world recently witnessed a baffling variety of global outbreak 
phenomena: the huge fluctuations across the world's financial mar- 
kets, driven in part by the rapid global spread of rumors 1 1 1; an un- 
expected global outbreak of swine flu['2|, driven in part by rapid so- 
cial mixing (e.g. within schools (3] 01); and even the sudden rise to 
global fame of an unknown Scottish singer, driven in part by word- 
of-mouth sharing|5 6 7|. To understand these phenomena, consider 
the following. The number and identity of the people with whom 
we are each in instantaneous electronic or physical contact - and 
with whom we can exchange information, rumors or viruses - are 
characterized by a strong persistence related to our social network of 
acquaintances, coexisting with large intermittent fluctuations arising 
from random interactions with various social groups. We are inter- 
ested in how the latter may give rise to novel types of dynamics, 
which are unexplained by standard epidemic models. Think for in- 
stance of airborne travel in which people remain confined for hours 
with strangers, unknowingly exchanging respiratory pathogens. On 
the blogosphere and on the Web, ephemeral groups form around top- 
ics or content and exchange information, opinions and social contacts 
before flickering out of existence. The transient transnational nature 
of online discussion groups and chat-rooms, as frequented by finan- 
cial traders or YouTube users (T]|6l|7) provides a vivid illustration. 

Before dwelling more on real- world examples and illustrating 
how they might reveal these novel dynamical properties, we first 
describe our model which combines group and individual dynam- 
ics. Current epidemiological models have been hugely successful 
in describing various biological diseases, and in incorporating many 



realistic details (e.g. spatial topology, differential susceptibility) 
El E3 mHIl D31 CCD El- Here, we study the poorly understood EQ 
dynamical regime shown in Fig. 1, where the group-level dynamics 
and individual-level transmission processes (modeled by SIR dynam- 
ics) evolve on similar timescales - and hence the number and identity 
of a given individual's contacts can change abruptly over time. In our 
model (Fig. lb), individual connectivities may change significantly 
on the same timescale as that of the SIR process, thereby mimicking 
those individuals such as participating in YouTube viewing, finan- 
cial systems, and schools, who may exhibit sometimes rapid moves 
among peer groups either online or in real space, while simultane- 
ously picking up and spreading rumors or pathogens. 

The grouping process at a given timestep involves coalescence 
and fragmentation events. In the simplest implementation, the rate of 
coalescence of two groups of size m and ri2 respectively is propor- 
tional to the combinatorial number of pairwise encounters between 
individuals, one from each group, i.e., the rate of coalescence is equal 
to Vcoai • m x 712, where v coa i < 1 quantifies the rate of group 
coalescence per pairs of individuals. Similarly, a given group of n 
individuals may fragment with a total rate equal to z/f rag • n, where 
Vfrag + ^coai < 1, reflecting the increasing fragility of large groups 
(standard size effect). 

A discrete illustration over six time steps is shown in Fig. lb 
and lc. Fig. Id contrasts the short-time group structure between 
individuals with the long-term linkage between them: as time in- 
creases without bound, by ergodicity, all individuals will have even- 
tually been part of some common group. While the latter long-time 
network structure is the one usually emphasized in models of epi- 
demic processes on complex networks, the short-time limited linkage 
is essential to understand the competition between individual isola- 
tion (which tends to stop an epidemic) and group coalescence which 
amplifies its spreading. Our model provides a simple framework to 
quantify the interaction of these group dynamical processes in com- 
bination with the SIR dynamics (with p and q as the infectivity and 
recovery parameters). 

We chose this specific fission-fusion process for several reasons: 
(1) It embodies the rare but dramatic changes of contact networks that 
can occur, as mentioned in the introduction. (2) It produces a distri- 
bution of group sizes which is power-law with exponent 5/2 when 
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time- averaged, as shown in a related model applied to financial mar- 
kets 1 15 1, in agreement with many empirical distributions across the 
natural and social sciences 1 16]. (3) The power-law exponent 5/2 is 
exactly that inferred for group sizes of financial traders, as well as 
terrorists and insurgent groups, based on an analysis of volume of 
trades and casualty figures respectively! 161 H71. (4) The model is 
structurally robust in that the group dynamic rules can be generalized 
to different positive power exponents a / 1,^ / 1,7 / 1, with 
coalescence and fragmentation rates given by v coa i • nf x and 
^frag • n 1 ', respectively, without losing the main qualitative features 
of the dynamics of the number I(t) of infected individuals. 

In the numerical implementation of the model, we run the above 
coalescence-fragmentation dynamics until the time-averaged distri- 
bution of group sizes has become stationary. Then, at some instant 
taken at the origin of time t = 0, one group is selected, and an arbi- 
trary individual in this group becomes infected, and hence the infec- 
tion profile unfolds according to the SIR process within each group, 
with all the groups undergoing at the same time the coalescence- 
fragmentation dynamics according to the two rates v coa i and ^f rag . 
Our model describes genuine within-group dynamics coexisting with 
between-group dynamics (our model is fundamentally different from 
a homogeneous-mixing metapopulation type model). 

A typical simulation is shown in Figure 2, and is compared with 
the popular approach that models spreading on static networks: (i) an 
instantaneous network (T = 1, purple curve) and (ii) a global net- 
work formed by time- aggregating instantaneous contacts over long 
times (T — > oo, green curve). SIR spreading dynamics on fixed net- 
works obtained at different intermediate T gives curves that lie in 
the shaded area of Figure 2. Our model can generate not only this 
type of dynamics, but also qualitatively new regimes that arise from 
adjusting the coalescence-fragmentation rates: the large fluctuations, 
resurgences, and abnormally long decay time which are observed in 
our model (and illustrated in Figure 2 (blue curve)) are generated by 
self- amplification and suppression processes due to the fission-fusion 
group dynamics at all group-size scales. 

How does this compare with real-life epidemic-like dy- 
namics? The top two rows of Figure 3 show two typi- 
cal example datasets that we have collected in each of the 
three domains mentioned above. (See below, and support- 
ing online material http : //www . er . ethz . ch/publi cat ions/ 
complex_systems/internet/group _dynamics_som.pdf ., for ex- 
perimental details). These three examples have been chosen to il- 
lustrate bursts of activity followed by slow intermittent relaxations, 
which can not be accounted for by standard SIR dynamics on fixed 
network topologies. The third row shows a realization of our model 
with sets of parameters chosen so that the main properties of the two 
first rows are qualitatively reproduced. These comparisons illustrate 
the power of our model to account for quantitatively different regimes 
in distinct domains within the same unifying conceptual framework 
taking into account the interplay between the timescales of the group- 
ing and transmission process. 

The left panel of Fig. 3 shows downloads for two simi- 
lar YouTube clips (6] [7). Such downloads are typically driven by 
YouTube users absorbing and spreading opinions as they share in- 
formation in their social groups Q. The two downloads appealed 
to similar age-groups, and were measured close together in time, 
implying that a similar pool of users accessed them both, in line 
with our model's assumptions. The fission-fusion group dynamics 
with SIR epidemic spreading accounts well for the long memory and 
aftershock-like decay. The middle panel shows foreign exchange 
movements as a result of a specific rumor spreading among traders 



concerning revaluation of the Chinese Yuan currency. This same ru- 
mor circulated twice in the space of a few months. The fact that 
the currency pairs follow a similar dynamical pattern in each case, 
suggests that the same underlying group dynamics developed, in line 
with our model. Note that this financial epidemic is characterized 
by the largest coalescence rate z/ coa i and much larger infectivity pa- 
rameter p among the three examples, reflecting the efficiency of the 
information cascade among currency traders. The right panel shows 
incidences of a cold among 1st grade students in two schools in Bo- 
gota, Colombia. The schools' location guarantees that seasonal tem- 
perature variations are minimal, and the student population of each 
approximates to a closed system due to local issues of security and 
social segregation. We argue that the immune systems of the chil- 
dren have been subjected to a soup of microbes coming and going, so 
that the successive bursts are part of the same dynamics, especially 
given the unchanging climatic conditions of this part of Columbia. 
Within our approach, the school cold dynamics are found to be best 
described by the lowest fragmentation rate z/f rag and highest recov- 
ery parameter q, mirroring the more rigid structure of inter-children 
contacts and the crucial role of multiple recuperations. 

A full analytic description of I(t) represents a fascinating open 
challenge. However some features can be captured by suitably gen- 
eralizing existing epidemiological machinery. A key quantity is the 
probability ?7si(t) that a particular link instantaneously exists and 
that it connects a susceptible and an infected. Thus, out of a po- 
tential totality of N(N — l)/2 links among N individuals, only 
r]si(t) • N(N — l)/2 are typically present. This provides an accu- 
rate equation for the number of susceptibles S(t) in a given epidemic 
sequence: S(t) = —p.rjsi(t).N(N — l)/2, with pis the infectivity 
parameter quantifying the probability that a particular S is infected 
by a particular / to which it is linked. We rewrite this in the conven- 
tional mass-action form S(t) = —pPsi(t)S(t)I(t), where Psi(t) is 
equivalent to rjsi(t)N(N — l)/[2S(t)I(t)] and hence now incorpo- 
rates the complex dynamics which are so hard to capture analytically. 
We now approximate Psi (£) by a constant term P, which is the time- 
averaged probability that any two arbitrarily chosen nodes belong to 
the same cluster independent of S7-infection status. This approxima- 
tion throws out the dynamical details of /(£), but can provide useful 
insights, as shown below on the non- spreading to spreading transi- 
tion, provided that several coalescence-fragmentation processes oc- 
cur over the timescale of the entire outbreak. We obtain P by solving 
a time-dependent master equation, yielding P = v coa i / (Nvf rag ) if 
the coalescence and fragmentation probabilities are of the same order 
and N ^> 1. This result can be derived independently using gener- 
ating function techniques on the coupled equations for groups of all 
sizes. 

We now use this result to address a highly topical, yet previously 
unaddressed, question: Will there be epidemic spreading in a popu- 
lation in which it is publically known that N persons have been in- 
fected with a given pathogen or rumor, but where the precise identity 
of infected persons cannot be disclosed? At t = 0, No <C ^in- 
dividuals of the instantaneously largest group are infected and news 
of an infection is announced without disclosing the infected's iden- 
tities. The population reacts by adjusting its group dynamics, i.e. 
it adopts a new z/ coa i and z^f rag . The number of susceptibles in the 
long-time limit S(oo) with N ^> 1 is then given by the solution 
z to the following generalized form, z = exp[— k(1 — z)] where 
z = S(oo)/N and k = pv CO a\/qv{rag- For k < 1, the only so- 
lution is z = 1, corresponding to a vanishingly small fraction of 
infected individuals (i.e. total number of infected R(oo) does not 
exceed No <C N). This solution bifurcates at k — 1 into the fol- 
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lowing stable solution z = —(1/k) -W (— k • e _K ) valid for k > 1, 
where W(z) is the Lambert function. For k > 1.5, z is very well- 
approximated by z « e~ K I n. This shows a rather abrupt transition 
from non- spreading epidemics for ft < 1 to global infection of a fi- 
nite fraction of the population for k > 1. The form of the epidemic 
control parameter k = p^coai / qWrag exemplifies that infectivity and 
coalescence play together against recovery and fragmentation in con- 
trolling the propagation of the epidemics: Infectivity and coalescence 
promote the infection propagation, while recovery and fragmentation 
hinders its spread. Our model provides a new window of opportunity 
for controlling epidemics, by providing a quantitative framework to 
better target actions at the group level through its fission-fusion dy- 
namics, thus improving the distinction with actions at the the more 
intrinsic level of the host- virus characteristics and dynamics (p, q). 

Figure 4 shows the resulting phase diagram as a function of the 
new zy C oai and ^f rag . Not only is our theory for the spreading thresh- 
old (dashed black line in Fig. 4) in excellent agreement with the 
numerical results (white solid line), its simple analytic form suggests 
a novel yet generic epidemic control scheme based on manipulation 
of the group coalescence and fragmentation timescales (i.e. z/^ and 
^frag)- ^ n i mmment epidemic can be suppressed (i.e. R(oo) < No) 
by increasing the timescale for group coalescence with respect to the 
timescale for group fragmentation (i.e. decrease zy coa i with respect 
to z^frag), but it will get amplified if we decrease the coalescence 
timescale with respect to the fragmentation timescale (i.e. increase 
^coai with respect to z/f rag ). Not only would such modest intervention 
allow the overall system to continue functioning, it does not require 
knowledge of the infected' s identities. There is also no assumption 
that the iVo members of the group which carries the initial infected 
case at t = 0, remain in that group. In the school setting, schedules 
could be adjusted to slow down or speed up classroom use and recess, 
without the need for disruptive school closures [4] or the need to test, 
label or isolate infected children. Similar control can be achieved in 
the online chatrooms frequented by financial traders, by basing the 
joining and leaving rules on present occupancy. In viral marketing, 
the attractiveness of the message or product quantified by the infec- 
tivity p can be completely subjugated by suitable management of the 
group dynamics (z^coai versus z/frag), as firms using e-commerce and 
e-advertisement are now realizing. These findings are potentially ap- 
plicable to many other scenarios, given that many real-world activ- 
ity/infection curves resemble those in Figure 2. Our framework of- 
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fers a theoretical underpinning to formulate new solutions as well as 
getting new sources of data probing the inner working of the interplay 
between within-group and between-group dynamics. 

Data Figure 3, left panel: 'Gettin' Enough' from http://www. 
youtube . com/watch? v=AiXxMrkeklg Video uploaded to site: 
Wed, 08 Nov 2006 13:33:12 GMT. First record of download: Thu, 
09 Nov 2006 17:21:35 GMT, view count: 5708. Last record: Thu, 
24 May 2007 21:34:30 GMT, view count: 257759. Length of 
video: 225 seconds. Middle-left panel, downloads of music video 
'Borat' from http : //www . youtube . com/watch?v=blxXERFt_Zg 
Video uploaded to site: Fri, 03 Nov 2006 11:04:15 GMT. First 
record of download: Tue, 07 Nov 2006 10:35:57 GMT, view count: 
20745. Last record: Thu, 24 May 2007 22:29:25 GMT, view count: 
254918. Length of video: 154 seconds. These two music down- 
loads are similar in terms of appeal, age-group, total number of 
downloads, and lack of any public/global announcement, news or 
advertisement, hence consistent with spreading through contagion. 
Bottom-left panel, model output with z/f rag = 0.05, z^coai = 0.81, 
p = 0.001, q = 0.001. Financial exchange rates for currency (cen- 
tre panels): Top-centre panel, CNY (Chinese currency) revaluation 
rumor, detected from trader chat-rooms by HSBC bank (courtesy of 
S. Williams). Absolute returns on the timescale of 1 -minute inter- 
vals, JPY (Japanese currency) exchange rates from 08:22 to 08:53 
GMT, on 11 May 2005. Middle-centre panel: CNY actual revalu- 
ation. Absolute returns on the timescale of 1 -minute intervals, JPY 
(Japanese currency) exchange rates from 1 1:03 to 1 1:34 GMT, on 21 
July 2005. Since the CNY was not one of the directly traded cur- 
rencies, its effects on the JPY-X rates (where X is another currency) 
are indirect in both cases, suggesting influence through contagion of 
the rumor/information. There was no public announcement or global 
news to trigger this activity, which also supports spreading through 
contagion. Bottom-centre panel: model output with z/f rag = 0.05, 
^coai = 0.95, p = 0.009, q = 0.002. Since data are on 1 -minute 
scale, but prices can change on 1 -second scale, we show an averaged 
output by providing value at regular equispaced intervals, mimick- 
ing 1 -minute. School colds data (right panels): Fraction of 1st grade 
children with colds in Marymount School and CNG School, Bogota, 
Colombia. Bottom-right panel: model output with z/f rag = 0.001, 
^coai = 0.5, p = 0.001, q = 0.004. The model output is smoothed, 
to mimic fact that data is recorded on the 1-week time interval. 
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Fig. 1 . Our dynamical group contagion model, a: Schematic of dynamical grouping of traders or YouTube users on the Internet b: Spreading in the presence of 
dynamical grouping via coalescence and fragmentation. Vertical axis shows number of groups of a given size at time t. c: Instantaneous network from Fig. 1 b at each 
timestep. d: Weighted network obtained by aggregating links over time-window T. 
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Fig. 2. Theoretical profile /(£). Blue curve shows our dynamical group contagion model, with ^f rag = 0.05, v coa \ = 0.95, p = 0.001 and q = 0.001. Using 
same p and q values, purple curve corresponds to stochastic SIR model on a static network with T — ► 1 (i.e. the t = network in Fig. 1 ). Green curve corresponds 
to stochastic SIR on a T — > oo network. 
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Fig. 3. Modern epidemics. Top two rows: Empirical activity profile I(t) in three distinct real-world systems. Third row: Results from dynamical grouping model. Left: 
YouTube download activity. Middle: Currency trading activity (i.e. absolute value of price-change, hence the excess demand to buy or sell at each timestep). Right: 
Fraction of children with colds within a school. Lower panel: Simple example of the repeated self-amplification and suppression processes which spontaneously arise 
within our the model. When replicated at all scales of group size, these processes generate a unified quantitative description of the empirical I(t) profiles. 
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Fig. 4. Consequences of human reaction to news of an outbreak. Phase diagrams show theoretically obtained transition (i.e. pUcoal = l, black dashed line) and 

the numerical result (white line) separating regimes of spreading (i.e. overall number of infecteds exceeds initial group size, hence R(oo) > No) and no-spreading 
(i.e. R(oo) < No). Population reacts to news of the initial infection at t = by changing its dynamical grouping from Vf rag = 0.001 and v coa \ = 0.99, to the new 
values shown on the axes. Colours show the population (in units of TVo) who become infected, and hence recovered, over the lifetime of the outbreak. Shaded grey 
region is unphysical since ^f rag + ^coai > 1- 
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